Measures of privacy in randomized response surveys for quantitative stigmatizing variables

نویسنده

  • Mausumi Bose
چکیده

In many socio-economic surveys, the variable of interest is sensitive or stigmatizing. Examples include tax evasion, criminal conviction, alcohol expenses, induced abortion, drug abuse, etc. In such situations, the technique of randomized response is very useful as it does not require the respondent to reveal his/her true value. The issue of privacy protection is important in this context and a few researchers have studied this problem for surveys on dichotomous populations, where the objective is to estimate the proportion of persons bearing the sensitive trait. There is a considerable literature on various randomized response techniques. However, not much is known as yet about the extent of privacy protection when the variable under study is quantitative in nature. In this article we study the issue of privacy protection when the randomized response technique is used for a quantitative variable which could be either discrete or continuous. We propose a measure of privacy for each case. Taking cognizance of a conflict between protection of privacy and enhancing estimation efficiency, we discuss how, given a stipulated level of our privacy measure, the parameters of the randomization device can be determined so as to maximize the efficiency of estimation. Numerical examples are provided to illustrate our results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Measuring and explaining tax evasion: Improving self-reports using the crosswise model

Due to its sensitive nature, tax compliance is difficult to study empirically, and valid information on tax evasion is rare. More specifically, when directly asked on surveys, respondents are likely to underreport their evasion behavior. Such invalid responses not only bias prevalence estimates but may also obscure associations with individual predictors. To generate more valid estimates of tax...

متن کامل

Estimation of Sensitive Proportion by Randomized Response Data in Successive Sampling

This paper considers the problem of estimation for binomial proportions of sensitive or stigmatizing attributes in the population of interest. Randomized response techniques are suggested for protecting the privacy of respondents and reducing the response bias while eliciting information on sensitive attributes. In many sensitive question surveys, the same population is often sampled repeatedly...

متن کامل

Impact of different privacy conditions and incentives on survey response rate, participant representativeness, and disclosure of sensitive information: a randomized controlled trial

BACKGROUND Anonymous survey methods appear to promote greater disclosure of sensitive or stigmatizing information compared to non-anonymous methods. Higher disclosure rates have traditionally been interpreted as being more accurate than lower rates. We examined the impact of 3 increasingly private mailed survey conditions-ranging from potentially identifiable to completely anonymous-on survey r...

متن کامل

On Estimation of Mean and Sensitivity Level of a Stigmatizing Variable Using Two Stage Scrambling

Procuring reliable data on sensitive/stigmatizing variables is a tricky issue in many social surveys. When questioned through direct questioning respondents may report evasive answers on such sensitive variables. Consequently, a bias creeps into the estimates based on evasive answers. Different indirect questioning methods such as Item Count Technique, Three Cards Method and Randomized Response...

متن کامل

Privacy Preserving Categorical Data Analysis with Unknown Distortion Parameters

Randomized Response techniques have been investigated in privacy preserving categorical data analysis. However, the released distortion parameters can be exploited by attackers to breach privacy. In this paper, we investigate whether data mining or statistical analysis tasks can still be conducted on randomized data when distortion parameters are not disclosed to data miners. We first examine h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013